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The density classification problem is one of the simplest yet non¬ 
trivial computing tasks which seem to be ideally suitable for cel¬ 
lular automata (CA). Unfortunately, there exists no one-dimensional 
two-state CA which classifies binary strings according to their 
densities. If, however, in place of simple cells one uses agents 
which change their behaviour from one rule to another after a 
fixed number of iterations, the classification can be performed 
by the traffic rule 184 and the majority rule 232. This two-rule 
solution cannot be easily generalized to two (or higher) dimen¬ 
sions, because it critically depends on a kinetic phase transition 
occurring in the rule 184. No rule exhibiting analogous transition 
is known in two dimensions, most likely because no such rule 
exists. We propose, therefore, to approach this problem form a 
slightly different angle, namely by introducing a stochastic com¬ 
ponent into each of the two rules. If one precedes each iteration 
of rule 184 by the stochastic “lane changing rule”, and each it¬ 
eration of rule 232 by the stochastic “crowd avoidance” rule, in 
the limit of infinitely many iterations the classification can be 
performed correctly with probability 1. This solution can be de¬ 
scribed either in the language of CA, or using the paradigm of 
agents which move and proliferate on the 2D lattice, following 
probabilistic rules. 
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1 INTRODUCTION 


The density classification problem (DCP) asks for a construction of a cellular 
automaton rule which, when applied to a binary string of density p, converges 
to all ones when p > 1/2 and all zeros when p < 1/2. From the time when 
Gacs, Kurdyumov and Levin proposed this problem and its first approximate 
solution m, a lot of research effort went into studying of this topic. After it 
has been proved by Land and Belew im that the perfect two-state rule per¬ 
forming this task does not exist, approximate solutions have been constructed 
using a variety of methods. The best currently known solutions, found by evo¬ 
lutionary algorithms, are described in ifT^ . For a good review of the density 
classification problem and its various modifications the reader may consult 
131 and references therein. 

Here we shall concentrate on a two-dimensional version of DCP. For an 
infinite two-dimensional array, the so-called Toom’s rule is know to be a so¬ 
lution 121, but its performance on finite lattices is unsatisfactory 0. Approx¬ 
imate solutions do exist, and similarly as in the one-dimensional case, they 
have been obtained by evolutionary algorithms iniiii. 

Since no single rule solving DCP in one dimension exists, the author pro¬ 
posed a generalized version of the problem involving two rules and found 
its exact solution 0. Later, other two-rule solutions have been found as 
well IT^ . The purpose of this paper is to explore a generalization of the two- 
rule solution to two dimensions. As we will shortly see, although a direct 
and naive generalization does not work, it can be fixed by adding stochastic 
components to both rules. 

2 SOLUTION IN ONE DIMENSION 

In one dimension, in order to classify a binary string of length L, one needs 
to apply rule 184 for a number of time steps, and then switch to rule 232 . To 
be more precise, let us define s = {sq, si,..., Si_i} to be a string of binary 
numbers. We impose periodic boundary conditions on it, so that all indices 
are taken modulo L. Density of s is defined as p = J2i=o ^i/^- We further 
define i?i 84 : {0,1}^ —>■ {0,1}^ as 

[.^184(s)]i — 1 (1) 

and i ?232 : {0,1}^ ^ {0,1}^ as 

[^ 232 (s)]i = majority Si, Sj+i}. (2) 
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In m, the following proposition is proved. 

Proposition 1 Let s be a binary string of length L and density p, and let 
n = [(L — 2)/2j, m = [(L — 1)/2J. Then f?^2^i84('5) consists of only 
O’s if p < 1/2 and of only I’s if p > 1/2. If p = 1/2, .R^ 2 ^i 84 ('®) 
alternating sequence of O’s and 1 ’s, i.e., . . . 01010101.... 

What makes this scheme work is a specific property of rule 184. First of 
all, this rule is known to be number-conserving, that is, it does not change 
the number of zeros or ones in the string to which it is applied. Moreover, if 
the initial string has more zeros than ones, after sufficiently many iterations 
of rule 184 all pairs 00 disappear. Conversely, if there is more zeros than 
ones in the initial string, all pairs 11 eventually disappear. Rule 232, on the 
other hand, has the property of growing continuous clusters of zeros in the 
absence of 11 pairs, and growing continuous clusters of ones in the absence 
of 00 pairs. 

In the light of the above, it is not hard to see that the combination of rules 
184 and 232 performs perfect density classification, as described in Proposi¬ 
tion 1. Could the same scheme be applied to two-dimensional binary arrays? 

Before we answer this question, let us remark that rule 184, being number- 
conserving, can be described using the paradigm of particles or agents, if one 
assumes that a site in state 1 corresponds to a cell occupied by an agent, and 
site in state 0 denotes empty cell. One can then show Q that rule 184 is 
equivalent to the following behaviour of agents: if an agent has an empty cell 
on the right hand side, it moves there, otherwise it does not move. All agents 
move at the same time. Since this is the simplest model of road traffic with 
agents being cars, rule 184 is also known as the “traffic rule”. It is obvious 
that the number of agents in this rule will always be preserved. 

The second rule, rule 232, can also be described in terms of agents, as fol¬ 
lows. If an agent has two empty neighbouring cells, it disappears, otherwise 
it remains. Moreover, if an empty cell is surrounded by two agents, a new 
agent is bom there. Birth and deaths of agents happen simultaneously on the 
entire lattice. 

The density classification problem can be now rephrased in the language 
of agents. We start with a lattice where a certain unknown number of cells is 
occupied by agents. We want to equip the agents with a local rule governing 
their behaviour such that if more than 50% of initial cells are occupied, in the 
end all cells are occupied. If less than 50% of cells are occupied by agents, 
then in the end all cells become empty. Proposition 1 tells us that if we let the 
agents move following the traffic rule for [(L — 2)/2j steps and then we let 
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them proliferate or die following rule 232 for [(L — 1)/2J time steps, in the 
end we will obtain the desired configuration. 


3 RULES 184 AND 232 IN TWO DIMENSIONS 

Let us now consider binary arrays L x L, with entries Sij, where i,j G 
{0,1,... L — 1}. Set of all such arrays will be denoted by S. As before, we 
will impose periodic boundary conditions, taking all indices i,j modulo L. 

A simple and naive generalization of Proposition 1 could involve rules 
184 and 232 “lifted” to two dimensions, by defining i?i 84 : S S and 
^232 : 5 -> iS as 


[-R232(s)]ij = majority 


(3) 

(4) 


Unfortunately, one cannot classify densities by applying i?i 84 for a number 
of steps and then switching to rule 232, as it was done in one dimension. If 
we apply i?i 84 to a two-dimensional array, each row will remain independent 
of all other rows. In particular, it may happen that in some rows all 00 pairs 
are eliminated, and in other rows all 11 pairs are eliminated. As a result, 
both 00 and 11 will still be present when one switches to rule 232, leading 
to (possibly) incorrect classification. One needs to introduce some sort of 
interaction between rows which would allow for transfer of 00 and 11 pairs 
between rows. 

The author experimented with various possibilities of inter-row interac¬ 
tions, but all of them turned out to be unsatisfactory. It seems that there 
exists no deterministic CA rule in two dimensions which would have prop¬ 
erties analogous to rule 184 in ID. However, having in mind recent progress 
on one-dimensional DCP using probabilistic rules illli, injection of some 
stochasticity into dynamics presents itself as a promising possibility. Indeed, 
as it turns out, if one allows probabilistic CA, two-dimensional analog of rule 
184 can be constructed rather easily. 

4 RANDOMIZATION OE RULE 184 

Let us first describe a rule which will be called “lane changing rule”, illus¬ 
trated in Figure [T] We will call this rule Fx (the meaning of the subscript 
X will be explained later). According to the rule Fx, agents move simulta¬ 
neously in such a way that every agent which has another agent on the right 
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★ 0 * 

Jp = 0.5 

*11 


FIGURE 1 

Particle (agent) representation of rule Fx ■ 


hand side and empty site above, moves up with probability 1/2 and stays in 
the same place with probability 1/2. All other agents do not move. If agents 
were cars moving to the right, one could say that cars which are blocked (have 
another car directly ahead) change lane if possible (move to the next row) - 
hence the name “lane changing rule”. 

Rule Fx can of course be described formally using the language of cellular 
automata. Let X denote an array of independent and identically distributed 
random variables Xij, where /, j € {0,1 ,... L — 1}, such that 

Pr(X,,, = 0) = Pr(X,,, = 0) = ^. (5) 

Furthermore, let us define 

( Si-1,j+l Si,j + 1 Si+I,j+1 \ 

Si—i,j Si,j Si-\-i,j I . (6) 

Si-1,j-1 Si,j-1 St+I,j-1 ) 

Using the above notation, we define Fx : 5 —> 5 as 




1 


< 


0 


Si, 


( ■k -k -k 
* 0 * 
* 1 1 
/ k 0 k 
if Uij =*11 
y * * * 

otherwise. 


and Xij-i = 1, 


and Xij = 1, 


(7) 
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This can be written in a more compact form as 


— Sij + (1 


Using the above, one can compute the sum of array’s entries after the appli¬ 
cation of Fx, 




(9) 


If one replaces j by j +1 in the second sum on the right hand side, it becomes 
the same as the third sum, meaning that the second and the third sum cancel 
each other. This leaves 



( 10 ) 


proving that F is number-conserving. 

The effect of application of Fx is a “diffusion” of pairs 11 between rows. 
Figure [^illustrates this using an initial array with two rows only (Figure]^). 
In the top row, there are two 00 pairs, and in the bottom row, two 11 pairs. If 
one applied Ris 4 to this configuration, pairs 00 and 11 would change position 
in their respective rows, but they would remain in the same row. Multiple 
iterations of i?i 84 would have a similar effect - pairs 00 and 11 would never 
be eliminated. Now, suppose that we apply Fx first. A possible outcome of 
this is a configuration shown in Figure]^. One of the agents from the lower 
row (the one located at 7th position from the right) jumped up to the upper 
row. Now, in the upper row we have one 00 pair and one 11 pair, and in the 
bottom row neither one of them. If one now applies i?ig 4 , 00 and 11 in the 
top row will “annihilate” each other, and the resulting configuration will be 
as shown in Figure]^. 

This means that instead of i?i 84 , we need to use FxRi%a- Rule FxRisa 
is a probabilistic CA rule, and by combining eqs. and (j^, one can obtain 
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FIGURE 2 

Illustration of properties of rule Fx'- (a) initial configuration s, (b) Fx{s), (c) 
Ri8iFx{s) 


its explicit definition, 

RxR i84:(^^} — ^i—lj “F 

+ (1 ~ Si—ij- — SijSi-^-ij + Si—ijSij) 

X + Sij-_i — Si-ij-iSij-i) 

X + Si+l,j-lSi+2,j-l — Sij_iSi+ij_i) Xij-i 

~ (Si —l,j + SijSi+ij- — Si_ijSij) 

X (1 — Si_i_j + 1 — Sij + iSi+ij+i + Si_ij-i-iSij + i) 

X (Sij' “F Si_|_i jSi^2,j Xi j. (11) 

Note that [Fxi?i 84 (s)]ij depends not only on nearest neighbours of Sij, but 
also on second nearest neighbours. FxRiba is thus a probabilistic CA rule 
with Moore neighbourhood of range 2. 

After sufficiently many iterations of this rule, the binary array will have no 
00 pairs (if it had more zeros than ones at the beginning), or no 11 pairs (if it 
had more ones than zeros at the beginning). This is precisely what is needed 
for rule 232 to do its job. 
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FIGURE 3 

Particle (agent) representation of rule Gx ■ 


5 RANDOMIZATION OF RULE 232 


Once the pairs 00 or 11 are eliminated (depending on the initial density), we 
need to grow clusters of zeros or ones, in order to reach the final configuration 
of all all zeros or all ones. 

If, after multiple iterations of FxRi 84 , every row included some 00 pairs 
(or some 11 pairs), iterating R 232 would produce the desired final configura¬ 
tion of all zeros (or all ones). However, it is entirely possible that after iter¬ 
ations of FxRi 8 a we will obtain some rows which are made of alternating 
zeros and ones. ... 010101.... Such rows after one iteration of rule 232 be¬ 
come ... 101010 ..., and after another iteration become again ... 010101 ..., 
thus they are periodic points of rule 232 (with period 2). This means that the 
desired final configuration of all zeros or all ones may never be reached if the 
“plain” i ?232 is iterated. We need a way to move 00 and 11 pairs between 
rows, just like before. 

This will be done using a probabilistic rule Gx, slightly different that 
previously defined Fx- Using again the agent paradigm, we will construct a 
rule in which an agent having 000 above itself will jump up with probability 
50% or stay in the same place with probability 50%. Similarly, an empty 
site which has 111 below will become occupied with probability 50% (the 
central agent from 111 below will jump to it), and will remain unchanged 
with probability 50%. This is illustrated in Figure]^ Gx could be called a 
“crowd avoidance” rule, since agents tend to avoid being surrounded by other 
agents, and tend to move into cells which have empty neighbours. 
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Using the notation introduced earlier, we can define Gx : S ^ S as 




1 

0 




1 

0 


if Ujj- 


if u,j- 


if Ujj 
if Ujj- 


ir 'k 'k 

0 0 0 

1 ★ 
0 0 0 

:Ar 1 ★ 

ir 'k -k 

k: k: -k 

0 * 
1 1 1 

★ 0 * 

1 1 1 

kr k k 


otherwise. 


and Xij-i = 1, 


and Xi j = 1, 


and Xij-i = 1, 


and Xij = 1, 


(12) 


The above rule can be written in an algebraic form similarly as done for rule 
Fx in eq. ([^. Then one can write equation analogous to eq. (j^ and show that 
Gx is number-conserving, by shifting dummy indices in the same fashion as 
we did for rule Fx ■ We omit these details since they are rather straightforward 
but tedious. Moreover, Figurej^makes it clear that the number of agents must 
stay constant. 

Rule GxR 232 does exactly what is needed: it grows clusters of zeros or 
ones, but it also eliminates rows of the type ... 01010101 .... 


6 CLASSIFICATION OF DENSITIES 

The density classification can now be performed by 

{GxR232fKFxRl84f\ (13) 

where, obviously, we need to use different random field X each time step. 
More precisely, we define 

(Gxf?232)^^(.Px^l84)^^ = {Fx<-^i+'^ 2)R232) ■ ■ ■ {Fxc^i+^i R' 232 ) 
{Fx(t-i+i)R 232 ) {FxiTi)Ri^i)... {FxinRisi) {FxmRisi) 

(14) 

where X^^\X ^^\... are independent identically distributed ran¬ 

dom arrays, each consisting of random variables X^*j with probability distri¬ 
bution described by eq. 0. To simplify our classification scheme, we will 
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take Ti = T 2 , even though this is not the most efficient choice (typically, T 2 
does not need to be as large as Ti). 

Let 0 denote an L x L array of zeros, and 1 an L x L array of ones. We 
propose the following conjecture. 

Conjecture. Let s be an L x L array of binary numbers. If the number of 
ones in s is greater than the number of zeros, then, with probability 1, 

lim {GxR232f {FxRi&if {s) = 1. (15) 

T —>00 

Similarly, if the number of ones in s is smaller than the number of zeros, then, 
with probability 1, 

lim {GxR 232 f {FxRisif {^) = 0. (16) 

T —>00 

In order to provide numerical evidence supporting this conjecture, we will 
evaluate performance of the rule {GxR 2 d, 2 )^{FxRisi)^ in classifying den¬ 
sities. Of course, we won’t be able to perform infinitely many iterations. 
Nevertheless, as we will see in the next section, the performance can be very 
good even if a finite T is used. 

7 PERFORMANCE 

Performance of a given rule in performing the density classification is typ¬ 
ically defined as follows. Let / denotes the number of random initial con¬ 
figurations consisting of N cells each, drawn from a symmetric Bernoulli 
distribution. This means that each initial configuration is generated by setting 
each of its cells independently to 0 or 1, with the same probability 1/2. If the 
resulting configuration has exactly the same number of zeros and ones, we 
flip one randomly selected bit to break the symmetry. 

Suppose that we iterate the rule T' on each of those initial configurations. If 
a configuration with initial density less than 0.5 converges to 0, we consider it 
a successful classification, similarly as when a configuration with initial den¬ 
sity greater than 0.5 converges to 1. In all other cases we consider the classi¬ 
fication unsuccessful. The percentage of successful classifications among all 
/ initial conditions will be called performance of the rule fir. In Table 1, we 
show the performance for I = 1000 and for two array sizes, = 50 x 50 
and N = 100 X 100. In order to show how the performance depends on T, 
we used different values of T ranging from 250 to 16000. One can see that 
for L = 50, 4000 iterations of each rule suffice to obtain the perfect perfor¬ 
mance. For L = 100, 16000 iterations are needed. This means that one needs 
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T 

performance for 
L = 50 

performance for 

L = 100 

250 

85.5% 

39.9% 

500 

96.1% 

75.9% 

1000 

99.4% 

90.3% 

2000 

99.7% 

97.5% 

4000 

100 % 

99.7% 

8000 

100 % 

99.9% 

16000 

100 % 

100 % 


TABLE 1 

Performance of {GxR 232 )^{FxRimY^ for I = 100 and for lattices 50 x 50 and 
100 X 100 for different values of T. 


only slightly more than three iterations per bit in both of those cases. This 
is understandable if one considers the “diffusive” nature of Fx and Gx- In 
FxRiBAi for example, pairs 00 perform a sort of random walk until they hit 
a pair 11. The required number of iterations of FxRisa, therefore, should be 
strongly cori'elated with the average hitting time for random walk. It is known 
that the average hitting time for a random walk on 2D periodic lattice roughly 
scales as ii, and, since the number of bits in the lattice also scales as 
we expect the the number of iterations per bit should remain roughly constant 
as L increases. 

We should also remark that although we took the number of iterations to 
be the same for both rules FxRisa and GxR 232 , this is not really necessary. 
In fact, the largest number of iterations of GxR 232 needed to converge to 
0 or 1 observed in our numerical simulations was 611 for 100 x 100 lattice 
and 265 for 50 x 50 lattice. Since clusters of zeros or ones in rule 232 grow 
linearly with time, and they grow only in horizontal direction, one can expect 
that the number of required iterations of GxR 232 grows linearly with L. 

8 CONCLUSIONS AND FURTHER WORK 

We presented construction of a probabilistic two-rule scheme which performs 
density classihcation in two dimensions with increasing accuracy as the num¬ 
ber of iterations increases. Although right now this scheme remains a conjec¬ 
ture, numerical evidence strongly supports it. Moreover, since the dynamics 
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of the classification process is rather well understood, it is quite likely that a 
formal proof of this conjecture may be within reach. It should also be possible 
to obtain some rigorous bounds on the expected number of iterations needed 
for classihcation. 

A crucial feature of the proposed solution is that the agents at some point 
of time change their behaviour from traffic coupled with lane changing to 
proliferation/death coupled with crowd avoidance. Could one devise a sin¬ 
gle rule achieving similar performance? In the one-dimensional case, this 
has been achieved, as the probabilistic solution proposed in il is actually a 
stochastic “mixture” of rules 184 and 232. The author plans to pursue this 
idea in the near future. 

One should also add that if one follows the spirit of the “classical” DCP, 
the agents should have no access to any global information. Since in the 
proposed solution they change their behaviour after T iterations, one could 
argue that they have access to a global timer. At the same time, the agents 
follow very simple rules, which makes it possible to describe their dynamics 
as CA. This brings an interesting question: if one denied the agents the access 
to the global timer, but instead equipped them with more sophisticated rules 
of behaviour (e.g., giving them local memory, making them “smarter”, etc.), 
could the DCP be solved? A recent progress on solving DCP using CA with 
memory m indicates that this could be a promising avenue to pursue. On 
might, therefore, ask a broader and more general question: what are the least- 
complex agents which could solve the DCP? The author hopes that this article 
stimulates further research in this direction. 
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